Proceedings of the 30th Pacific Asia Conference on Language, Information and Computation, PACLIC 30, Seoul, Korea, October 28 - October 30, 2016
نویسندگان
چکیده
The German Grammar group develops a fully formalized and computer-processable set of grammarsthat share a set of constraints, that is, they have a common core (see also Müller, 2013 for an overview).Some very general con-straints hold for all grammars, some for subgroups of languages. Currently we work on: 1. German (Germanic, SFB 632, A6, Müller, 2007; Müller and Ørsnes, 2011),2. Danish (Germanic, DFG MU 2822/2-1, Ørsnes, 2009; Müller, 2009; Müller and Ørsnes, 2011;Müller and Ørsnes, In Preparation),3. Persian (Indo-Iranian, DFG/ANR MU 2822/3-1, Müller, 2010, Müller und Ghayoomi, 2010),4. Maltese (Semitic, Müller, 2009),5. Mandarin Chinese (Sino-Tibetian, DFG MU 2822/5-1, Lipenkova, 2008; Müller andLipenkova, 2009),6. Spanish (Romance, SFB 632, A6),7. French (Romance, SFB 632, A6),8. Yiddisch (Germanic, Müller and Ørsnes, 2011) and9. Hindi. The approach to developing the core grammar is bottom-up in that we do not assume a geneticallydetermined Universal Grammar and try to prove its existence in language after language. Rather wetreat every language in its own right and try to generalize over sets of languages only later. Some of thisknowledge might be part of an UG in the above sense, but we do not make any claims on this issue. We also do not make an explicit core-periphery distinction while working on individual languages.Rather what belongs to the core is determined empirically by comparing languages. If we find aphenomenon in more than one language and think that it is correct to describe the phenomenon by thesame means, the respective representations are kept in one file that is used by the respective grammars.This results in a grouping of languages that share the same code, with files containing very generalconstraints being used by all languages.PACLIC 30 Proceedings 330th Pacific Asia Conference on Language, Information and Computation (PACLIC 30)Seoul, Republic of Korea, October 28-30, 2016 Inferring Methodological Meta-knowledge from Large BiomedicalCorpora Goran NenadicUniversity of [email protected]
منابع مشابه
The Significance of Background Information in Acceptability Judgements of Korean Sentences
Linguists in the Generative Linguistics tradition typically rely on their own intuition as a native speaker for their core data. This practice has continually been challenged to be more rigorous in their data collection methodology, and Experimental Syntax is an effort to address the issue. The purpose of this presentation is to discuss whether some background information of the subjects might ...
متن کاملThe Challenge of Simultaneous Speech Translation
Simultaneous speech translation attempts to produce high quality translations while at the same time minimizing the latency between production of words in the source language and translation into the target language. The variation in syntactic structure between the source and target language can make this task challenging: translating from a language where the verb is at the end increases laten...
متن کاملThe grammar and semantics of disjuncts in World Englishes
Adverbs have become the ragbag in grammar in which all uncategorized items are relegated. Over the years, there have been several studies (e.g., Biber et al., 1999; Halliday, 1994; Hasselgard, 2010; Huddleston & Pullum, 2002; Quirk et al., 1985; Sinclair, 1990) that looked into the syntactic and semantic functions of adverbs. This paper focuses on what Quirk et al. (1985) call ‘disjuncts’ (whic...
متن کاملInferring Methodological Meta-knowledge from Large Biomedical Corpora
Large amounts of biomedical corpora have emerged from different sources, including scientific literature, lab notes, patents and electronic health records. Most of the efforts in biomedical text mining have focused on the extraction and linkage of specific facts, such as molecular interactions, links between genes and diseases, or patients’ symptoms. Such facts are rarely contextualised using t...
متن کاملThe CoreGram Project: Theoretical Linguistics, Theory Development and Verification
The German Grammar group develops a fully formalized and computer-processable set of grammars that share a set of constraints, that is, they have a common core (see also Müller, 2013 for an overview). Some very general con-straints hold for all grammars, some for subgroups of languages. Currently we work on: 1. German (Germanic, SFB 632, A6, Müller, 2007; Müller and Ørsnes, 2011), 2. Danish (Ge...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016